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Abstract — Low-density parity-check (LPDC) decoders assume 
the channel estate information (CSI) is Isnown and they have the 
true a posteriori probability (APP) for each transmitted bit. But 
in most cases of interest, the CSI needs to be estimated with the 
help of a short training sequence and the LDPC decoder has 
to decode the received word using faulty APP estimates. In this 
paper, we study the uncertainty in the CSI estimate and how it 
affects the bit error rate (BER) output by the LDPC decoder. To 
improve these APP estimates, we propose a Bayesian equalizer 
that takes into consideration not only the uncertainty due to the 
noise in the channel, but also the uncertainty in the CSI estimate, 
reducing the BER after the LDPC decoder. 

I. Introduction 

Single input single output (SISO) communication channels 
can be characterized by a linear finite impulsive response that 
either represents the dispersive nature of a physical media 
or the multiple paths of wireless communications |[l]. This 
representation causes inter-symbol interference (ISl) at the 
receiver end that can impair the digital communication. Given 
this channel estate information (CSI), the maximum likelihood 
sequence detector (MLSD) |2| -Viterbi Algorithm- provides 
the optimal transmitted sequence at the receiver end. And, if 
we are interested in a posteriori probabilities (APP) for the 
transmitted symbols, we can use the BCJR algorithm (3], that 
provides bitwise optimal decisions. 

Channel encoders introduce controlled redundancy in the 
transmitted sequence to correct errors caused by the channel. 
Modern channel decoders, such as turbo or low-density parity- 
check (LDPC) codes ID, need accurate APP estimates to be 
able to achieve channel capacity f5\. 

In a previous work, we have shown that accurate APP 
estimates increase the performance of LDPC decoders |6|, 
although that work focuses on nonlinear channel estimation. 

In this paper, we study how the uncertainty in the estimation 
of the CSI affects the optimal performance of modern channel 
decoders. The CSI is acquired using a training sequence 1 1 1 
and it is typically estimated by maximum likelihood (ML). 
These training sequences are necessary short to reduce the 
transmission of non-informative symbols, yielding inaccurate 
CSI estimates. Thus, the BCJR assuming an ML estimation 
(hereafter refers as ML-BCJR) only delivers an approximation 
to the APP for each symbol because it does not include the 
uncertainty in the estimate. Inaccuracies in the APP estimates 
degrade the performance of channel decoders for turbo or 
LDPC codes 17], 18J. We consecutively propose and analyze a 



Bayesian equalizer, which takes into account the uncertainty 
in the CSI estimate to produce more accurate APP estimates. 

The difference between the bit error rate (BER) of the ML- 
BCJR and the proposed Bayesian equalizer is not significant, 
although it slightly favors the Bayesian equalizer However, 
this is not an accurate measure of the quality of the APP 
estimates for each equalizer since it only considers hard deci- 
sions, in contrast to the soft inputs needed by modern channel 
decoders. Thus, assuming LDPC coding in our communication 
system, we can compare the quality of the APP estimates for 
each equalizer. We experimentally show that at the output 
of the LDPC decoder the Bayesian equalizer considerably 
improves the performance of the ML-BCJR equalizer, when 
we measure the probability of error These gains are more 
significant for high signal to noise ratios, channels with long 
impulsive responses and/or short training sequences. 

There are some related works where uncertainties are ex- 
ploited on the estimation of the transmitted symbols. In the 
framework of turbo-receivers |9|, some approaches can be 
found in the literature that incorporate these uncertainties in 
the iterative process of equalization and decoding. It has been 
exploited in ifTOl . where the authors use an MMSE to estimate 
the channel, and in fTT], flT], where they do not focus on the 
optimal estimation of the APP. In ||13| we find a proposal to 
estimate some parameters in a OFDM system to later include 
them in the decoding. In fT4l, the authors consider the channel 
estimation inaccuracies during the decoding process by means 
of a practical decoding metric. 

The paper is organized as follows. In Section|ll]we describe 
the structure of a SISO communication system and the ML- 
BCJR solution. The proposed Bayesian equalization technique 



is presented in Section III Experimental results in Section 
IV help to illustrate the performance of our method. Finally, 
Section|V]ends with conclusions and some proposals for future 
work. 

II. ML-BCJR Equalization 

We consider the discrete-time dispersive communication 
system depicted in Fig. [T] The channel H{z) is completely 
specified by the CSI, i.e., h = [hi, /12, • • ■ , /il]^, where L is 
the length of the channel. We model the values of the channel 
h as independent Gaussians with zero-mean and variance 
equal to 1/i (Rayleigh fading). 
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Fig. 1. System model. 



A block of K message bits, m = [mi,TO2, . . . jUij^] , is 
encoded with a rate R = K/N to obtain the codeword b = 
[5i, 62, . . . , ^at]^ that is transmitted over the channel using a 
BPSK modulation: 



b7h 



Wi 



(1) 



where hi — [5^, . . ., 6i_L+i]^ and Wi is additive white 
Gaussian noise (AWGN) with variance ct^. Thus, the received 
sequence is x = [xi,X2, ■ ■ ■ , a^Af]^- 

At the beginning of every block we transmit a pream- 
ble with n known bits (6°,..., 6°) and the receiver uses 
"D = {a;°, 6°}"^]^, the training sequence, to estimate the 
channel. The ML criterion is widely considered for the task 
of estimation; 



Hml = argmaxp(x°|b°,h). 

h 



(2) 



Once we have estimated the channel coefficients with the 
preamble, we apply the BCJR algorithm to obtain the posterior 
probability estimates for each transmitted bit: 



= 6|x, hAfi) 



1,2... TV. 



(3) 



Finally we decode the received word using the LDPC 
decoder to obtain a maximum a posteriori estimate for m^. 

The LDPC decoder relies on the estimates in (j3]l being 
accurate and, if they are not, the decoding might not finish 
or might return an incorrect codeword. In Fig. [2] we show 
p{bi = l|x, Hml) versus the probability p(bi = l|x, h) for 
one estimation of the fading channel h = [1,0.1] obtained 
assuming n — 6, and a signal to noise ratio (SNR) equal to 
dB. We can see in Fig. |2]that the predictions for each bit 
are quite accurate. But these posterior probability estimates 
are overconfident roughly half of the time, because ML is an 
unbiased estimator of the CSL which could derail the LDPC 
decoder, because a bit with high confidence for a zero or a 
one is hard to overrule if it is incorrect. 

in. Bayesian Equalization 

If we provide the BCJR with the true CSI we have the 
exact APP |3|. However, in practice we usually use the ML 
criterion in (j2]) to estimate the CSI from a training sequence. 
This estimated CSI, provided as ground truth to the BCJR 
algorithm, assigns inaccurate estimates of the APP and might 
mislead the channel decoder to deliver the incorrect codeword 
(or not to converge at all). We propose a Bayesian equalizer 
that takes into account both the uncertainty about the noise 
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2. Calibration curve for the ML-BCJR assuming SNR = dB and 
[1,0.1]. 



and the uncertainty about the CSI estimate. We compute the 
posterior probability as: 

p{h = 6|x, T^)= J Pih = 6|x, h)p{h\V)dh, (4) 



where & = ±1 and 



p{h\V) 



_ p(hMx°|b°,h) 

p(x°|b°) 
'p{xi°,...,Xn°\bi' 



(5) 



is the posterior probability for the CSI, given the likelihood 
(Gaussian noise) and the prior (e.g. Rayleigh fading). 

The marginalization of h in (j4]) provides equal or better 
APP estimates that the ML-BCJR, because it includes all the 
information of p(h|I?). In fact, if the training sequence is large 
enough, this Gaussian posterior tends to a multidimensional 
delta centered at its mean, whose value tends to the real value 
of h, as IiAfL does. However, the performance of the ML- 
BCJR is misled in case of uncertainty in the CSI. Conversely, 
the Bayesian equalizer in (|4]) considers the uncertainty in the 
estimation, providing more accurate APP estimates. 

This computation of the APP does not yield a significant 
improvement in the BER after the equalizer, because for 



detection we only consider if the probability is lower or higher 
than 0.5. This is illustrated in Section ITV] 

However, the main advantage of the Bayesian equalizer is 
that the performance of the soft-decoder improves with this 
new estimation of the probabilities, that translates into better 
results in terms of BER at the output of the decoder. In 
Fig. [3] we show p{bi — l|x, 2?) versus p{bi — l|x, h) for 
a fading channel h = [1, 0.1] estimated with the same training 
sequences assumed in Fig |2] Compared to the ML-BCJR in 
Fig.[2j in the Bayesian equalizer the zero-mean Gaussian prior 
tends to underestimate the CSl. Then, the averaging over 
all possibles values of j3(h|2?) gives mostly underconfident 
posterior probability estimates, as we can observe in Fig. [3] 
Therefore, the LDPC decoder does not have such a strong 
preference for the received bits and they can be easily flipped, 
if necessary. 
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Fig. 3. Calibration curve for tlie Bayesian equalizer assuming SNR = dB 
and h = [1,0.1]. 

A. Computation of the solution 

We propose to use Monte Carlo to obtain an approximate 
result to Q considering the following steps: 

1) Calculate the posterior of the channel: in (j5]l the nu- 
merator is the product of the likelihood p(x°|b°,h) 
and the prior of h. Considering a system with BPSK 
modulation in a Rayleigh channel, both terms are real 
valued Gaussians distributed as: 

p(x°|b°,h)~AA((B°)Th,a2l), (6) 
p(h)~AA(0,I), (7) 

where B° — [bi°, b2°, . . . , b„°] is an L x n matrix, and 
without loss of generality we assume that the variance 
of the CSI is equal to one. The expressions for the 
mean and the covariance matrix of the posterior when 



both terms are Gaussians are straightforward ifTSll . These 
expressions can be particularized for our system as: 

hh|p = (I + B°(B°)Ta-2)-iB°a-2x°, (8) 
Ch|p-(I + B°(B°)^a-2)-i, (9) 

where x° = [x^ , xg, . . . , 

2) Produce random samples from the posterior: with the 
vector of means and the covariance matrix, we can 
sample to obtain M random samples. 

3) Solve the BCJR algorithm: the APP estimates of each 
transmitted bit is computed for the M different samples 
of p(h|2?). 

4) Computation of (|4]l: the M different values of p{hi = 
6|a;i, . . . , xat, h) average the APP of each transmitted 
bit over all possible cases of h: 

p(h = b\xi, . . .,XN,D) sa 

1 ^' 

~ J^^Pi^i = ^\^i^---^^N,'iij), (10) 

which yields an approximation of Q. 
As already discussed, if we use these optimal APP esti- 
mates as inputs to a soft-decoder, the system achieves better 
performance, especially in case of inaccurate estimations of the 
CSl, i.e., the length of the training sequence is short compared 
to the number of channel taps. However, this solution is 
computationally demanding, because we have to calculate 
M times the BCJR algorithm, whose complexity increases 
exponentially with L. 

IV. Simulation Results 

To illustrate the performance of the proposed receiver, we 
compare its bit error rate curves to the ones of the ML-BCJR, 
before and after the decoder In all experiments we consider 
the following scenario: 

• Block frames of 500 random bits encoded with a regular 
LDPC code (3,6) of rate 1/2. 

• Up to 10000 frames of 1000 bits are transmitted over the 
channel. 

• Between frames, a training sequence of n random un- 
coded bits is transmitted to estimate the channel. 

• Every frame, and its associated training sequence, is sent 
over the same Rayleigh fading channel. We consider that 
the channel coherence time is greater than the duration of 
the frame, i.e., the channel does not change during this 
time. Furthermore, in our experiments we take the same 
value for the taps of the channel during all transmitted 
frames. For the channel with three taps -L = 3- these 
values are: 

H{z) = 0.3482 + 0.8704Z-1 + 0.34822^^ 
and for L = 6: 

H{z) = 0.1600 + 0.5450z"i - 0.6720^-2 + 
+ 0.2560^-3 + 0.0950z"^ - 0.3890z"^ 



• We consider for the Bayesian estimation a prior with zero 
mean and variance equal to 1 for all the taps. 

We depict in Fig. [4] the BER measured after the ML- 
BCJR (dashed line) and Bayesian (solid line) equalizers for 
the channel with L = 3 and different lengths of the training 
sequence. The differences between both equalizers are negli- 
gible, although the BER of the Bayesian equalizer is always 
lower. 
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Fig. 4. BER performance for Bayesian equalizer (solid lines) and ML-BCJR 
(dashed lines) before the decoder, for a channel with 3 taps and different 
lengths of the training sequence, n = 10 (o), n = 15 (□) and n = 20 (o). 
Dash-dotted line illustrates performance assuming perfect CSI. 

In Fig. |5] and [6] we show the BER after the LDPC decoder 
has corrected the errors introduced by the channel. In these 
plots, we can observe that for short training sequences the 
gains of using the Bayesian equalizer over the ML-BCJR are 
significant (over 1 dB). The difference between the results in 
Fig.|4] and Fig.[5]and[6]can be explained by the APP estimates 
given by each procedure. When we measure the BER after the 
equalizer (Fig. |4]), we only care if each bit has been correctly 
decoded and, consequently, we are only measuring how good 
the APP estimate of the 50% percentile is. When we measure 
the BER after the LDPC decoder (Fig. |5] and |6]l, we need 
that the APP estimates are accurate everywhere, because the 
LDPC decoder needs these estimates to decode correctly the 
transmitted codeword, i.e., the Belief Propagation algorithm 
uses the APP for each individual bit. These results sustain 
our claim that the Bayesian equalizer provides more accurate 
predictions of the APP than the ML-BCJR equalizer, as the 
LDPC decoder is able to better decode with them. 

Particularly, in Fig. |5] we can observe that the Bayesian 
equalizer improves the ML-BCJR equalizer as we increase 
the SNR for a fixed length of the training sequence and both 
equalizers tend to coincide as we increase the length of the 
training sequence. This last result is expected, because as we 
increase the length of the training sequence the posterior for 
the CSI tends to a delta function and the ML-BCJR and 
Bayesian equalizers coincides. But for these equalizers to 
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Fig. 5. BER performance for Bayesian equalizer (solid lines) and ML- 
BCJR (dashed lines) after the LDPC decoder, for a channel with 3 taps and 
different lengths of the training sequence, n = 10 (o), n = 15 (□), n = 20 
(o), ?i = 35 (v) and n = 60 (A). Dash-dotted line illustrates performance 
assuming perfect CSI. 
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Fig. 6. BER performance for Bayesian equalizer (solid lines) and ML-BCJR 
(dashed lines) after the decoder, for a channel with 6 taps and different lengths 
of the training sequence, n = 15 (o), n = 25 (□), n = 40 (o), n = 50 (v) 
and n = 90 (A). Dash-dotted line illustrates performance assuming perfect 
CSI. 



coincide, we need over 60 training samples (more than 20 per 
coefficient) and for shorter training sequences the Bayesian 
equalizer is vastly superior. 

Finally, in Fig. |6] we plot the curves for the channel with 
6 taps. In this experiment, we can see the previous result 
magnified, because the longer the impulsive response of the 
channel is the more uncertain the channel estimate is and the 
higher the room for improvement for the Bayesian equalizer. 
In this experiment, we can observe gains over 1 dB for training 
sequence with 25 symbols and SNR = 7 dB. 



V. Conclusions and Future Work 

Channel equalization has been traditionally solved using 
a discriminative model where the only variable modeled as 
random is the noise. The generative model introduced in 
this paper, where the posterior probability density function 
of the estimated CSI is included, is a more efficient solution. 
If we are to just estimate the encoded transmitted symbols, 
the discriminative model is a good choice. However, if the 
estimation of the APP is needed, i.e., the decoder very much 
benefits from this information, the discriminative solution 
exhibits poor results whenever the CSI is badly estimated. On 
the contrary, the Bayesian approach exploits the full statistical 
model to provide optimal APP. We prove that these estimations 
are useful if a LDPC encoding is used. Other decoders may 
take advantage of this solution as well. 

Despite both the BCJR algorithm and LDPC decoding can 
be computed efficiently using machine learning algorithms 
applied on graphs fTS^, the drawback of this proposal is 
its computational complexity, because we have to compute 
several times the results for the BCJR algorithm to average the 
integral (j4]). Alternative graphical representation or inference 
algorithms that capture the essence of this approach may yield 
sub-optimal but less demanding solution for Bayesian equal- 
ization proposed. Furthermore, other systems and channels can 
be considered. width=2.5in]subfigcasel 
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